Universality for the distance in finite variance random graphs: 

Extended version 



Henri van den Esker* Remco van der Hofstad ' Gerard Hooghiemstra* 

May 19, 2008 



Abstract 

We generalize the asymptotic behavior of the graph distance between two uniformly chosen 
nodes in the configuration model to a wide class of random graphs. Among others, this class 
contains the Poissonian random graph, the expected degree random graph and the generalized 
random graph (including the classical Erdos-Renyi graph). 

In the paper we assign to each node a deterministic capacity and the probability that there 
exists an edge between a pair of nodes is equal to a function of the product of the capacities 
of the pair divided by the total capacity of all the nodes. We consider capacities which are 
such that the degrees of a node has uniformly bounded moments of order strictly larger than 
two, so that, in particular, the degrees have finite variance. We prove that the graph distance 
grows like logj, N, where the v depends on the capacities. In addition, the random fluctuations 
around this asymptotic mean log„ N are shown to be tight. We also consider the case where 
the capacities are independent copies of a positive random A with P (A > x) < cx 1 ~ T , for some 
constant c and r > 3, again resulting in graphs where the degrees have finite variance. 

The method of proof of these results is to couple each member of the class to the Poissonian 
random graph, for which we then give the complete proof by adapting the arguments of [13] . 

1 Introduction 

Various papers (see e.g., [U [HI [I8j [20]) study properties of random graphs with a given degree 
sequence. Among such properties as connectivity, cluster size and diameter, the graph distance 
between two uniformly chosen nodes is an important one. For two connected nodes the graph 
distance is defined as the minimum number of edges of a path that connects these nodes. If the 
nodes are not connected, then the graph distance is put equal to infinity. 

For the configuration model (see Section 1.4 for a definition) a distance result appeared in |13j . 
when the distribution of the i.i.d. degrees -D (C) satisfies 

P(D (C) > x) < cx l - T , (1.1) 

for some constant c, all x > 0, and with r > 3. We use the superscript (C) to differentiate between 
models. The result in [13] states that with probability converging to 1 (whp), the average distance 
between nodes in the giant component has, for 

E[^)(^)-l)] 

E[J)M] ' 1 ' 

bounded fluctuations around log^ N . The condition v > 1 corresponds to the supercritical case of 
an associated branching process, and is the condition under which a giant component exists. 
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In this paper we extend the above distance result to a wide class of random graphs. Models 
which fall in this class are the generalized random graph (GRG), the expected degree random graph 
(EDRG) and the Poissonian random graph (PRG). All three models will be introduced in more 
detail below. 

The method of proof is coupling. It is shown that the distance result holds for all models in 
the general class if and only if the result holds for the PRG (Section 2). In Section 4 we prove 
the distance result for the Poissonian random graph. This proof is parallel to that in [13] for the 
configuration model. In this paper we included full proofs of the auxiliary lemmas contained in 
Section El since details of these proofs are different from those in [13] . 

1.1 Model assumptions 

The graph models considered here are static models, meaning that the number of nodes is fixed. 
The graph G N has N nodes, numbered 1,2, ... ,N. Associated with the nodes is a sequence {Aj}^ 
of positive reals. We call Aj the capacity of node i; nodes with a large capacity will obtain a 
high degree, whereas nodes with small capacity will only be incident to a limited number of edges. 
Furthermore, we define 



i.e., l N is the total capacity of all nodes of the graph G N . 

The binary random variables {Xij}i<i<j<jy, are defined by setting Xij = 1, if the edge between 
node i and node j in the graph G N is present, otherwise we set X^ =0. If i > j, then by convention 
Xji = Xij. We call {X^} the connection variables and pij = P (X^ = 1) the connection probability 
of edge ij. In this paper we consider graphs G N that satisfy the following two assumptions: 

Al: The connection variables {Ajj}i<j <:; <7v, are independent. 

A2: The connection probability py, for 1 < i < j < N, can be written as p^j = h(\Xj/l N ), for 
some function h : [0, oo) — ► [0, 1], satisfying 



1.2 Three special cases 

We give three examples of random graph models, which satisfy assumptions Al and A2, and hence 
fall in the class of models considered here. 

The first example is the Poissonian random graph (PRG), which is a variant of a model intro- 
duced by Norros and Reittu in [20]. The second random graph model, which we call the expected 
degree random graph (EDRG), is introduced by Chung and Lu in [TJ [8]. The third and last example 
is the generalized random graph (GRG), which was introduced by Britton, Deijfen and Martin-L6f 



We now define the three models and verify that each of them satisfy the conditions Al and 
A2. 

• Poissonian random graph: In [20] the Poissonian random graph (PRG) was introduced. 
The main feature of such a graph G^f is that, conditionally on the capacities, the number 
of edges between any pair of nodes i and j is a Poisson random variable. The model in [20] 
is introduced as a growth model, but as a consequence of [23 Proposition 2.1], it can be 
formulated as a static model, and we will do so. Start with the graph G# consisting of N 
nodes and capacities {Aj}^. The number of edges between two different nodes i and j is 
given by an independent Poisson random variable E?- with parameter 



l N = Ai + A 2 + • • • + A 



(1.3) 




(1.4) 



in@. 




(1.5) 
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Denote by 1a the indicator of the set A. The connection variables are then X { -P = 1 . „(p) , 
so that, for 1 < i < j < iV, the connection probabilities are given by 

pg> = P(A<f = 1) = P(4 P) > 0) = 1 - eM-kXj/ln) = h (p) (\i\j/l N ), 

where /i (F) (x) = 1 — e~ x . Obviously, h {F) {x) — x = 0[x 2 ), for x [ 0. Since, by definition, the 
random variables {N^^i^i^KN are independent, we conclude that the assumptions Al and 
A2 are satisfied. 

It should be noted that in this paper we define the PRG using a deterministic sequence of 
capacities, while Norros and Reittu start with an i.i.d. sequence of random capacities. The 
case where the capacities {A,}^ 1 are i.i.d. random variables, satisfying certain tail estimates, 
is a special case of our set up and is studied in more detail in Theorem 11.41 

• Expected degree random graph: In [8] a random graph model is introduced starting 
from a sequence of deterministic capacities {\}f =1 . We construct the EDRG G® as follows. 
Let {X^ }i<i<j<N be a sequence of independent Bernoulli random variables with success 
probability 

where x Ay denotes the minimum of x and y. This minimum is to ensure that the result is a 
probability. 

Assumption Al is satisfied by definition, since the connection variables are independent 
Bernoulli variables, and assumption A2 is also satisfied if we pick = x A 1. 

If we assume that Xi\j/l N < 1 for all 1 < i < j < N, then the expected degree of a node i is 
given by Xf. 



pjf = P [X[f = 1 = (AjAj/Zjv) A 1, for 1 < i < j < N, 



E 



N 



N 

'y ] \ Xj /i N = X{. 



The Erdos-Renyi random graph, usually denoted by G(N,p), is a special case of the EDRG. 
In the graph G(N,p), an edge between a pair of nodes is present with probability p £ [0, 1], 
independently of the other edges. When p = X/N, for some constant A > 0, then we obtain 
the graph G(N, X/N) from the EDRG by picking A.; = A, for all i, since then p\j = \Xj/l N = 
X/N = p, for all 1 < i < j < N. 



Generalized random graph: The generalized random graph (GRG) is an adapted version 
of the EDRG, see the previous example. We define G^ with A^ nodes as follows. The sequence 
of connection variables is again given by a sequence of independent Bernoulli random variables 
{ X ij } }i<i<j<N with 

(G) -t\ _ (G) _ AjAj/ljy 



v 1 + XiXj/l N ' 

In [6] the edge probabilities are given by (XiXj/N)/(l + X{Xj/N), so that we have replaced 
Xi/N 1 / 2 by \/l l J 2 , 1 < i < N. This makes hardly any difference. 

Again, the assumptions Al and A2 are satisfied. To satisfy assumption A2 we pick h {G) {x) = 
x/(l +x) = x + 0(x 2 ). 

1.3 Main results 

We state conditions on the capacities {Aj}^, under which our main result will hold. We shall need 
three conditions, which we denote by (CI), (C2) and (C3), respectively. 
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(CI) Convergence of means: Define 



1 V A 

^=mJ2 X ^ and ^ = ^-\ (1-6) 

then there exist constants jj, £ (0, oo), 1/ G (1, oo) and ai > such that 

l/ijv-H = 0{N-^), \v N -v\ = 0(N~°*). (1.7) 

(C2) Convergence of branching processes: Define 

/W = i- y e - A * ^ and = V , 



N N n+1 

A and 5r = irr-Fe- Ai ^V, (li 

iV ^ n! n Nu N ^ n! v 

i=l i=l 



then there exist sequences {/ n } n >o and {g n }n>o> independent of N, and ct2 > such that 

d TV (f m J) = 0{N~ a *) and d TV ( 9 w , 5 ) = 0(iV- a2 ), (1.9) 

where d TV (-, ■) is the total variance distance, i.e., for probability mass functions p = {pj} and 
Q = {Qj}- 

d TV (p,<7) = -Qj\- (1-10) 

j 

(C3) Moment and maximal bound on capacities: There exists a r > 3 such that for every 
e > 0, 



1 N 

limsup — V Ap 1_e < oo, and A#° = max A* < iV 7 , (1.11) 

where, 



7 = ^- + £<l/2. (1-12) 

T — 1 



It is not hard to see that fj, N and v N in (jl.6p are the means of the probability mass functions 
{fn' ) }n>o, {ffrT^n^O) respectively. Thus, (CI) is equivalent to the fact that the means of {/n^jn^Oj 
{dn f) }n>o converge. It turns out that (CI) is equivalent to the convergence of the first and second 
moment of the degree of a uniform node. 

Condition (C2) says that the laws {/n }n>0: {<?n }n>0 are close to certain limiting laws 
{/n}n>0; {dn}n>o, which shall turn out to be crucial in our proofs, since it allows us to use coupling 
to branching processes. 

The second bound in Condition (C3) gives an upper bound on the maximal capacity of a node, 
while it can be seen that the first inequality is equivalent to the statement that a uniform node has 
a uniformly bounded moment of order at least r — 1 — e. Since r > 3, we can pick e > so small 
that t — 1 — e > 2, so that the degrees have finite variances. 

We shall prove our main results in the generality of Conditions (C1)-(C3), but shall give special 
cases when Conditions (C1)-(C3) following our main results. 

In order to be able to state our main results, we define the process {Z t } t >o as a branching 
process (BP) starting from Z = 1, where in the first generation the offspring distribution is equal 
to {/„}, whereas in the second and further generations the offspring is chosen in accordance to 

{g n }- 

We define the average graph distance or hopcount H N between two different randomly chosen 
nodes A\ and A2 in the graph G N as the minimum number of edges that form a path from the 
node A\ to node A2 where, by convention, the distance equals 00 if the nodes A± and A2 are not 
connected. 
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Theorem 1.1 (Fluctuations of the graph distance) Assume that the capacities {Xi}^L 1 sat- 
isfy (ClJ-fCS,), and let the graph Gn, with capacities {Aj}^ 1 satisfy Al and A2, for some func- 
tion h : [0,oo) 1— > [0, 1]. Let <tjv = [log^^l an d a N = Civ — ^°Su^- There exist random variables 
(-R a )a6(-i,01 such that, as N — » 00, 

F(H N = a N + k\H N <oo)=F(R aN = k) + o(l), k = 0, ±1, ±2, . . . . (1.13) 

We identify the random variables (R a ) a e(-i,o] in Theorem II .31 below. Before doing so, we state 
one of the consequences of Theorem 11.11 

Corollary 1.2 (Concentration of the graph distance) Under the given assumptions of The- 
orem [T7J[ 

• with probability 1 — o(l) and conditionally on H N < cxd, the random variable H N is in between 
(lie) log„ N for any e > 0; 

• conditionally on H N < oo, the sequence of random variables H N — \og v N is tight, i.e., 

lim limsupP(|iI w -log^iV] < K\H N < 00) = 1. (1.14) 

We use a limit result from branching process theory to identify the limiting random variables 
(i?a)ae(-i,o] • ^ is well known, see [HJ P- 244], that the process {^ t ///i/* _1 } t >i is a non-negative 
martingale and consequently converges almost surely to a limit W: 

2 

lim — = W, a.s. (1.15) 

Let W (1) and W (2) be two independent copies of W in (|1.15p . then we can identify the limit 
random variables (-R a ) aG (_i j o] as follows: 

Theorem 1.3 For a G (-1,0], 

¥{R a > j) = E[exp{-Kis a+j W w W (2) } |W (1) W (2) > 0] , 

where k = fj,(u — 1) . 

A special case of the above theorems is the case where 

X l = (l-F)- 1 (i/N), i = l,2,...,A, (1.16) 

where F is a distribution function of a positive random variable, i.e., F(0) = 0, F is non-decreasing 
and lirn^oo F(x) = 1, see (jB.ip for a definition of (1 — i* 1 ) -1 . In Appendix B, we will formulate quite 
general conditions on a distribution function F such that (C1)-(C3) hold with {\}f =l defined by 
(|1.16p . The special cases 

C 1 /C\1/(t-1) 

F(x) = l--^ I , for which (l-F)- 1 (y) = (-) 

where r > 3, extends the results obtained by Chung and Lu [7] from H N / \og y N — — > 1 to the study 
of the fluctuations of H N . 

Instead of assigning deterministic capacities to the nodes one can also assign random capacities. 
Associated with the nodes is a sequence {Aj}^ 1 of positive i.i.d. random variables, with distribution 

F A (x) = P(A < x) . 

Then, we set, for 1 < i < N, 

Xi = Aj. 
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Figure 1: Empirical survival functions of the graph distance for r = 3.5 and for four values of N. 
Each plot is averaged over 1000 samples. 



For the i.i.d. case, we can identify [i,v,f and g appearing in conditions (C1)-(C3) as 



(j, = E[A] 



E[A 2 ] 
E[A] 



/„ = E 



A" 



■;?■! 



and 



9n 



1. 



-E 



A n+1 



■III 



(1.17) 



for n > 0. 

The next theorem states that results of the deterministic capacities carry over to the random 
capacities: 

Theorem 1.4 Given an i.i.d. sequence of random variables {Aj}^ 1; with common distribution 
function F\ . If there exist constants c > and r > 3 such that 



1 - F A (x) < cx , for all x > 



J1.18) 



and with u, given by (jl,17p . satisfying v > 1, then there exists an event J , which occurs with high 
probability, such that conditionally on {Aj}^ satisfying J , the conditions (C1)-(C3) hold. 

The proof of Theorem 11.41 is given in Appendix[Cj We have the following corollary to Theorem ll.4t 

Corollary 1.5 In the case of i.i.d. capacities, with common distribution function F\ satisfying 
(|1.18p and with v > 1, the results of Theorem Corollary li.ffl and Theorem \1.3\ hold with high 
probability. More precisely, for every k = 0, ±1, ±2, . . the random variable 



P(H N = a N + k 




P(H N <oo 


Mix) 



•(Ra N =k) 



(1.19) 



converges in probability to zero. 



We demonstrate Theorem 11.11 for the i.i.d. case using Corollary 11.51 Assume that 

1 - F A (x) = cx l ~ T l {x>xo} , 

with r = 3.5, c = 2.5981847 and x = 0.7437937, then v = E[A 2 ] /E[A] w 2.231381. We can pick 
different values of the size of the simulated graph, so that for each two simulated values N and M 
we have ajy = «m- As an example, we take N = Mv 2k , for some integer k. This induces, starting 
from Nq = M = 5000, by taking for k the successive values 0,1,2,3 

Nq = M = 5000, N! = 24895, N 2 = 123955, and N 3 = 617181. (1.20) 
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Observe that ajv fc = 0.6117... for k = 0,1,2,3. According to Corollary 11.51 the survival function 
of the graph distance H N , run approximately parallel on distance 2 in the limit for N — > oo, since 
log„ Nh = \og v M + 2k for i = 0,1,2,3. In Figure Q] we have simulated the survival function of 
the graph distance for the graphs with sizes with k = 0,1,2,3, and, indeed, the plots are 
approximately parallel on distance 2. 

A general version of inhomogeneous random graphs with independent edges is presented in [5]. 
Our choice of graphs is a special case of the rank-1 case treated in [5j Section 16.4]. In the general 
setting of [5], the vertices are given by C X, for some state space X, and there is an edge 

between vertices Xi and Xj with probability 

Pij = mm{K(xi,Xj)/N, 1}, (1-21) 

where k : X x X — > [0, oo) is a suitable kernel. The rank-1 case of [5] is the case where k is of product 
form, 1.6. ; Ki(^X<i^ X j ) = il){xi)il){xj) for some tp: X — > [0, oo). In fact, it is even possible that k depends 
on N , I.e., t\i — Ki j\f , such that k n converges to some limit as N — » oo in a suitable way. This allows 
one to simultaneously treat the cases where pij = 1 — e~ K \ x i> x i)/ N or py = n(xi, Xj)/ (N + n(xi,Xj)) 
(recall the special cases in Section [L2l) . In [5], various results are proved in this generality, such 
as the phase transition of the resulting graphs, and distance results, such as average distances and 
the diameter. The main tool is a deep comparison to multitype Poisson branching processes. 

In particular, [5J Theorem 3.14] states that for suitable bounded n N , the average distances 
between uniform pairs of connected nodes is equal to (1 + o(l)) log u N, for a certain v > 1. The 
condition v > 1 corresponds exactly to the random graph being supercritical (see [5j Theorem 3.1]). 
This average distance result generalizes the first result in Corollary 11.21 apart from the fact that in 
our setting we do not assume that k n is bounded. In fact, in certain be unbounded. 

In our paper, we state conditions on ip in the rank-1 case of [5] under which we can identify the 
fluctuations of the average distances. It would be interesting to generalize our main results to the 
general setting of [5], i.e., to study the fluctuations of H N in the general setting of [5]. However, 
it is unclear to us how we should generalize the tail assumption in (|1.18p to this inhomogeneous 
setting. 

I. 4 Relations with the configuration model 

The configuration model (CM) appeared in the context of random regular graphs as early as 1978 
(see [21 [E]). Molloy and Reed [T7] were the first to use the configuration model with specified 
degree sequences. Here we consider the CM as introduced in [13]. Start with an i.i.d. sequence 
{D^}fL 1 of positive integer valued random variables , where £)• will denote the degree of node i. 
To built a graph it is mandatory that Df ] + Df ] + ... + is even, so if Df ] + Df ] + ... + 
is odd we increase by one, which will have little effect. We build the graph model by attaching 
Df stubs or half edges to node i and pair the stubs at random, so that two half edges will form 
one edge. 

In [13], the authors prove a version of Theorem II .1111.31 for the configuration model. Theorems 

II. 1111.31 hold for the configuration model with only two changes: 

1. Replace the condition v > 1 in Theorem 11.11 which is implicitly contained in (CI) by the 
condition v > 1, defined in (jl.2p . 

2. Replace the offspring distributions of the BP {Z t }t>o, by 

(a) 

f n = ¥(D^ = n), n>l, 
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(b) 



9n = 



(n + l)/ n +l 

E[D^] 



n>0. 



For the configuration model, the setting is as in Theorem 11.41 where, for the CM, the degrees are 
chosen in an i.d.d. fashion. The result in [13] proves that when we pick two uniform nodes, that 
their distance, when averaged out over the randomness in the i.i.d. degrees, satisfies (|1 . 13|) . A 
version in (|1.19p . which holds conditionally on the random degrees, is not proved in [13j . and is 
stronger than (|1.13p . We conjecture that a version of (|1.19p also holds in the CM, when the degrees 
satisfy appropriate conditions that would take the place of (C1)-(C3). 

One wonders why a result like Theorems 1.1-1.3 holds true for the class of models introduced 
in Section 1.1, especially if one realizes that in the CM the degrees are independent, and the edges 
are not, whereas for instance in the GRG (and in the other two examples) precisely the opposite is 
true, i.e., in the GRG the edges are independent and the degrees are not. To understand at least 
at an intuitive level why the distance result holds true, we compare the configuration model with 
the generalized random graph. 

By construction the degree sequence Df\ D^, • • • , D^ of the CM is an i.i.d. sequence, and 
conditionally on V = {D^ = di,D^ = d2,---,D^ = d N }, the graph configuration is uniform 
over all configurations satisfying T>, because the pairing is at random. Hence if we condition on both 
the event T> and the event S = {the resulting graph has no self- loops and no multiple edges}, then 
the CM renders a simple graph, which is picked uniformly from all possible simple configurations 
with degree sequence satisfying V. Since for N — > oo the probability of the event S converges to 
exp(-P/2 - P 2 / 4 ) > (see [31 p. 51]), it follows from [161 Theorem 9.9] that properties that hold 
whp in the CM also hold whp in the conditioned simple graph. Hence a property as tightness of 
the graph distance in the CM is inherited by the conditioned simple graph, with the same 
degree sequence. This suggests that also the limiting distribution of the fluctuations of the graph 
distance in the CM conditioned on S is the same as the one in the CM as identified in [T3J. A 
direct proof of this claim is missing though. 

On the other hand the GRG with given degree sequence d\ , di , . . . , d N is also uniform over 
all possible (simple) configurations. Moreover [6J Theorem 3.1] shows that the degree sequence 
D j G) , , . . . , D# of the GRG is asymptotically independent with marginal distribution a mixed 
Poisson distribution: 



where -Fa is the capacity distribution. Hence starting from D[ , D 2 , • • • , D N as an i.i.d. sequence 
with common distribution given by (|1.22p . the (conditioned) CM with these degrees is close to the 
GRG, at least in an asymptotic sense, so that one expects that the asymptotic fluctuations of the 
graph distance of the CM also hold for the generalized random graph. Also note from the mixed 
Poisson distribution (jl.22p . that 



which is equal to u, according to (|1.17|) . As said earlier, a proof of this intuitive reasoning is missing, 
and our method of proof is by coupling each random graph satisfying Al and A2 to the Poisson 
random graph (PRG), and by giving a separate proof of Theorem 1.1-1.3 for the PRG. 

We finish this section by giving an overview of different distance results in random graphs. Let r 
denote the exponent of the probability mass function of the degree distribution. In this paper and in 
[71 [13] the case r > 3 is studied. Results for 2 < r < 3 for various models appeared in [7[ [T4" l [TOl [20] . 
Typically in that case, the distance fluctuates around a constant times 2 log log N/\ log(r — 2)|. For 
1 < r < 2, there exists a subset of nodes with a high degree, called the core (see [TO]). The core 




k = 0,1,2... 



(1.22) 



E[D (C) (D (C) - 1)] _ E[A 2 ] 
E[D^] ~ E[A] 
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forms a complete graph and almost every node is attached to the core and, thus, the graph distance 
is whp at most 3. 



I. 5 Organization of the paper 

The coupling argument that ties the fluctuations of the graph distance in the PRG to the 
fluctuations of the graph distance in random graphs satisfying assumptions Al and A2 is treated in 
Section [2j In Section 4 we show that the fluctuations of the graph distance is given by Theorem 

II. 11 The derivation of the fluctuations of the graph distance Hffi is similar to the derivation of the 
fluctuations of the graph distance in the configuration model, see [13]. The proof in [13] is 
more complicated than the proof presented here for the PRG model, mainly because in the latter 
the expansion of a given node e.g., the nodes on a given distance, can be described by means of 
the so-called Reittu-Norros process, a marked branching process. This branching process will be 
introduced in Section [3l 

In this paper full proofs of the auxiliary propositions and lemmas introduced in Sections [3] and 
H] are presented in Appendix [Dj These proofs were omitted in [9] . 

2 Coupling 

In this section we denote by G N the PRG and by G' N some other random graph satisfying the 
assumptions Al and A2, given in Section fl.ll We number the nodes of both G N and G' N from 1 
to N and we assign the capacity \, for 1 < i < N, to node i in each graph. We denote by H N and 
H' N the graph distance between two randomly chosen nodes A\ and A2, such that A\ 7^ A2, in G N 
and G' N , respectively. We will show that for N — » 00, 



The above implies that whp the coupling of the graph distances is successful. Therefore, given the 
succesful coupling (|2.ip . it is sufficient to show Theorem II .11 for the PRG. 

The coupling bound in (|2.ip has since been considerably strengthened by Janson in [15], who 
studies when two random graphs with independent edges are asymptotically equivalent. By [151 
Example 3.6] and under the assumptions Al and A2, we have that the probability of any event 
A N is asymptotically equal for G N and G' N when N — > 00. 

2.1 Coupling of G N and G' N 

We next describe the coupling of the connection variables of the graphs G N and G' N . A classical 
coupling is used, see e.g. [21]. Denote by {Xij}x<i<j<N an d {^ij}i<i<j<N the connection variables 
of the graphs G N and G' N , and, similarly, denote the connection probabilities by {pij}i<i<j<N and 
{Pij}l<i<j<N- For the coupling we introduce independent random variables {-fQj}i<i<j<iv- Set 
p. . = min{pij , p'ij} and p^ = max{py,^ }, and define random variables X%j and X[- with 



whereas all other combinations have probability 0. Then the laws of Xy and X[- are the same as the 
laws of Xij and X'-, respectively. Furthermore, K{j assumes the value 1 with probability \pij —p[j\, 
and is otherwise. Note that we do abuse the notation in the above display. We should replace 
the probability measure P in the above display by some other probability measure Q, because the 
probability space is defined by the graphs G N and G' N , instead of only the graph G N . Since the 




(2.1) 
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graphs, conditioned on the capacities, are constructed independently from each other, this abuse 
of notation is not a problem. 

Consider the nodes i and j, 1 < i < j < N, in the graphs G N and G' N simultaneously. Then 
the event {Kij = 0} = {Xij = X'^} corresponds to the event that in both graphs there exists a 
connection between nodes i and j, or that in both graphs there is no connection between nodes 
i and j. The event {Kij = 1} = {Xij ^ X'-} corresponds with the event that there exists a 
connection in one of the graphs, but not in the other one. We call the event {K^ = 1} a mismatch 
between the nodes i and j. 

Assumption A2 implies that for some constant C > 0, 

A? A 2 

P (Kij = 1) = \ Pij - p' %3 \ < \ Pij - \\j/l N \ + 1^ - XiXj/l N \ < (2.2) 

for all 1 < i < j < N. The number of mismatches due to all the nodes incident to node i, 1 < i < N, 
is given by 

K t = Y,Kij. (2.3) 

Obviously, we cannot couple all the connections in the graphs G N and G' N successfully, but the 
total number of mismatches due to all the nodes can be bounded from above by any positive power 
of N. To this end, we define the event A N as 



A 



TV r TV ^ 

n { k ^>c N} = 0} = e k ^>c N } = , (2.4) 

i=l U=l J 

where c N = N^, with £ > 0. Then, on the event A N , all nodes with capacity greater than c N are 
successfully coupled. 

Lemma 2.1 For each £ > there exists a constant 9 > such that 

F(A C N ) = 0(N-°). (2.5) 
Proof. We bound P (A%) using Boole's inequality, and the Markov inequality: 

TV TV 

{Ai>Cjv}- 

(2.6) 

i=l i=l 

Then, using (|2.2p . (|1.6j) and (|1.7j) . the expectation ¥\Kj\ can be bounded by, 



E[ Ki ] = £E[i^.] = E F (^ = !) < ^JT Yl A i = = °( A ^ 1 ) ( 2 - 7 ) 

3+1. 1+1. J=l 



Using the above and (|2.6p . we have that 

AT , TV 



F(A%) < ^[Ki] l {Xl>CN} = <?(^ £ A 2 l {Ai>C]v} ). (2-8) 

i=l i=l 

Observe that, 

TV TV TV 

i=l i=l i=l 

where we applied (jl.lip in the last step. Pick 9 = £(r — 3 — e), then combining the above display 
and ([231), that 

¥(A%) = 0(c- {r - 3 - £) ) = 0(N- e ). 

□ 
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2.2 Coupling the graph distances of G N and G' N 

In this subsection we couple the graph distance of the PRG with any random graph satisfying 
assumptions Al and A2. 

Theorem 2.2 Let G N be a PRG and let and G' N be a random graph satisfying assumption Al and 
A2. Let H N and H' N be the graph distances between two different uniformly chosen nodes A\ and 
A2 in, respectively, the graphs G N and G' N . Then 

P [H N ^ H' N ) = o(l). (2.9) 

In order to prove Theorem 12.21 we wm use the following strategy. We know that for the PRG 
the random variable H N is concentrated around log u N. Hence if we take t N = [(^ + rj)\og v N\, 
with n > 0, then P (H N > 2t N ) is small and in order that H N ^ H' N on the set where {H N < 2t N }, 
there must have been at least one mismatch between two nodes i and j, both being not on a greater 
graph distance from our random node A than 2t N . 

We define the neighborhood shells of a uniformly chosen node A £ {1, 2, . . . , N}, i.e., all nodes 
on a fixed graph distance of node A, by 

3M = {A} and dM k = {1 < j < N : d(A, j) = k}, (2.10) 

where d(i,j) denotes the graph distance between nodes i and j, i.e., the minimum number of edges 
in a path between the nodes i and j. Furthermore, define the set of nodes reachable in at most t 
steps from root A by 

t 

Mt = {l<J<N: d(A,j) < t} = \J dN k . (2.11) 

fc=0 



Proposition 2.3 For N sufficiently large, t E N, and every b € (0, 1) ; 

P(H N ^H' N ) <F(A c N )+F(H N >2t) + 2tF(\N t -i \ > N b ) + 0(tN- 1+h c%) . (2.12) 



Before giving a proof, we show that Theorem 12.21 is a consequence of Proposition 

Proof of Theorem [231 By Lemma EH we have that, P(^) < N~ e . From Corollary O 
applied to the PRG model, we obtain that P (H N > 2t N ) = o(l). The third term on the right side 
of (|2,12j) can be bounded using the following lemma: 

Lemma 2.4 Let {Mt}t>o be the reachable sets of a uniformly chosen node A in the PRG G N . Then 
for rj, 5 € (—1/2, 1/2) and all t < (1/2 + 77) logj, N , there exists a constant (3\ > such that 

P (\M t \ > N 1 ' 2 ^ = (D((\og v N)N~^). (2.13) 

Proof. See the proof of Lemma ID.2I □ 

We now prove that all terms in the right hand of (|2.12p are o(l) for an appropriate choice of 
b. Lemma El implies that 2t N F (\M t \ > N b ) = o(l) for some appropriately chosen b > ^. Then, 
provided that b < 1, we see that ^^-14 = tNN 4$+b-l = ^ 1 ) ! where 

we substitute c N = N^, 

and picking £ S (0, (1 — 6)/4). Hence, by Proposition 12.31 P (H N ^ H' N ) = o(l), which is precisely 
the content of Theorem 12.21 □ 
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Proof of Proposition 12.31 We use that 

P (H N 7^ H' N ) < P (A%) + P (H N > 2t) + P < 2i} n A N n {# w ^ ffjj) . (2.14) 

Let A/f and A/f\ for i = 1,2, be the union of neighborhood shells of the nodes A{ in G N and G' N , 
respectively. Now, we use the fact that if H N < 2t and if H N ^ H' N , then A/ (1) ^ 7V/ (1) and/or 
A" t (2) 7^ A" t /(2) . By the exchangeability of the nodes, we have 

P ({H N < 2t} nA N n {H N + H'J) < 2P ({Af t ± TV/} n A N ) . (2.15) 

If Af t 7^ A"/, then there must be a A; G {1, . . . , t} for which A4 / Af' k , but A4-1 = -^I-i- Thus, 

t 

P < 2t} nA N n {H N + H' N }) < 2 F ({A/fc # A^} n {A4-i = Ad} n Ar) . (2.16) 

k=l 

In turn, the event {Af k 7^ A"^} D {Afc_i = A^_ 1 } implies that one of the edges from dAf k -i must be 
miscoupled, thus Ky = 1 for some i G dAfk-i and j G J^k-v wnere -^fe-i = {1)2,..., N}\Af k -i- 
The event .4.^ implies that Xi,Xj < c N . Therefore, we bound 

p ({A4 + K) n {A4-i = A^_J nA N )<r (|jv fc _i| > /a 6 ) 

+ ({» G a/V fc -i, j G Afk-i,Kij = 1} n {|A4-i| < iV 6 }) l^,^}- (2.17) 

Since i G A/'^_ 1 and j G dAfk-i, the event {-Kjj = 1} is independent of Afk-i and, therefore, 
from dAfk-i as dAfk-i C Af k ~i- The edge between the nodes i and j points out of Af k -i, while 
Afc_i is determined by the occupation status of edges that are between nodes in Afk-2 or pointing 
out of d Afk-2- Thus, we can replace each term in the sum of (12.17P by 

P(A^- = 1)P ({i G SA4-1, j G A£_i} n {\Af k -t\ < iV 6 }) 1{A,,A,<^}- (2-18) 
Since by (12. 2|) . we have 

P(A^- = 1) l {A! , Aj < Civ} < C'^l {Xi ^ CN} = 0{ciN- 2 ), (2.19) 
we can bound the right side of (|2.17p from above by 

P (|A4-i| > iV 6 ) + 0(c%N~ 2 ) ({» e dMk-iJ g A/£_i} n {|Ar fc _!| < iV 6 }) . 

Finally, we bound the sum on the right side by 

({* G cWfc-l, i g A4 C _ X } n {\Af k -i\ < iV 6 }) < NE 

Therefore, we can bound each term in the sum of (|2.16p by 

p ({A4 + M' k ] n {A4-i = A/jU) n A N ) < P (|A4-i| > iv 6 ) + 0(4Ar 1+b ). 

Since, for k < t, we have that P (|A4_i| > A^ fe ) < P (|A/i-i| > N b ), by summing over k = 1, . . . , t 
in (|2.16|) . we arrive at 

P ({H N < 2t} HA N n {H N H' N }) < 2tF (\Af t -i\ > N b ^j + 0(10%^^). 

Together with (|2.14p this proves the proposition. □ 



IW-ilW^i^v*,} 



< N 



1+6 
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3 The Poissonian random graph model 



The proof of the fluctuations of the graph distance in the CM in |13] has been done in a number of 
steps. One of the most important steps is the coupling of the expansion of the neighborhood shells 
of a node to a BP. For the PRG, we follow the same strategy as in [13], although the details differ 
substantially. 

The first step is to introduce the NR-process, which is a marked BP. The NR-process was 
introduced by Norros and Reittu in [20]. We can thin the NR-process in such a way that the 
resulting process, the NR -process, can be coupled to the expansion of the neighborhood shells of 
a randomly chosen node in the PRG. Finally, we introduce capacities for the NR-process and the 
NR -process. 



3.1 The NR-process and its thinned version 

The NR-process is a marked delayed BP denoted by {Z t , M t }t>o, where Z t denotes the number of 
individuals of generation t, and where the vector 

M t = (M t>1 , M t , 2 , . . .,M t .z t ) G {1, 2, . . . ,Nf% 

denotes the marks of the individuals in generation t. We now give a more precise definition of the 
NR-process and describe its connection with the PRG. We define Z = 1 and take M 01 randomly 
from the set {1, 2, . . . , N}, corresponding to the choice of A\. The offspring of an individual with 
mark m € {1, 2, . . . , N} is as follows: the total number of children has a Poisson distribution with 
parameter A m , of which, for each i £ {1, 2, . . . , N}, a Poisson distributed number with parameter 

(3.1) 

bears mark i, independently of the other individuals. Since Yld=iK\n/^N = \n, and sums of 
independent Poisson random variables are again Poissonian, we may take the number of children 
with different marks mutually independent. As a result of this definition, the marks of the children 
of an individual in {Z t , M t }t>o can be seen as independent realizations of a random variable M, 
with distribution 

P (M = m) = j 11 , l<m<N, (3.2) 

I (V 



and, consequently, 



N N 



m=l m=l 



For the definition of the NR -process we start with a copy of the NR-process {Z t , M t }t>o, and 
reduce this process generation by generation, i.e., in the order 

M , 1 ,M lil ,...M 1 ,z 1 ,M 2A ,... (3.4) 

by discarding each individual and all its descendants whose mark has appeared before. The process 
obtained in this way is called the NR -process and is denoted by the sequence \Z t , M t \t>n- One of 
the main results of [20] is Proposition 3.1: 



Proposition 3.1 Let {Z t , M t }t>o be the NR -process and let M t be the set of marks in the t—th 
generation, then the sequence of sets {M_ t }t>o has the same distribution as the sequence {dMt}t>o 
given by ()2.1Qj) . 
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Proof. In [20] it is assumed that the sequence of capacities is random, but in most proofs, 
including [20j Proposition 3.1], Norros and Reittu condition on the capacities, and therefore consider 
the capacities as deterministic. Thus, the proof of [20\ Proposition 3.1] holds verbatim. □ 

As a consequence of the previous proposition, we can couple the NR-process to the neighborhood 
shells of a uniformly chosen node A E {1,2, ... , N}, i.e., all nodes on a fixed graph distance of A, 
see (|2.10p and note that A ~ M, u . Thus, using the the above proposition, we can couple the 
expansion of the neighborhood shells and the NR -process in such a way that 

M t = dM t and Z t = \dAf t \, t > 0. (3.5) 

Furthermore, we see that an individual with mark m, 1 < m < N, in the NR- process is identified 
with node m in the graph G N , whose capacity is given by A m . 

The offspring distribution / (JV) of Z x , i.e., the first generation of {Z t }t>o, is given by 

1 N \ n 

=F(P 0l (X A ) =n) = -J2 e ~ A ™^' n * 0. (3.6) 

m=l 

Recall that individuals in the second and further generations have a random mark distributed as M, 
given by (|3.2p . Hence, if we denote the offspring distribution of the second and further generations 
by gn i then we obtain 

N >„ . -, N 



\n \ i iv \n+l 

g^ =F(Poi(\ M ) =n) = J2 e " Am ^T7^ = 77 E e ~ Am ^T> n * °- ( 3 - 7 ) 

m= 

Furthermore, we can relate g^ and by 



n\ l N l N ^ n\ 

m=l m=l 



9 " " ln/N („ + !)!- • (3 ' 8) 

It follows from condition (C2) that /„ — > f n and g„ — > g n . 
3.2 Coupling with a delayed BP 

In this subsection we will introduce a coupling between the NR -process and the delayed BP {Z t } t >o, 
which is defined by condition (C2) in Section 11.31 This coupling is used in the proof of Theorem 
II .11 and [L3l for the PRG, to express the probability distribution of H N in terms of the BP {Z t } t >o- 

Introduce the total capacity of the t-th generation of the NR-process {Z t , M t }t>o and the 
NR -process {Z t , M t }t>n as, respectively, 

_ Z t _ z t 

C t+1 = ^ A(M M ) and C t+1 = ^ A(M t , i ) , t > 0, (3.9) 

i=l i=l 

where, to improve readability, we write X(A) = A^. Using the coupling given by (|3.5p . we can 
rewrite the capacity as 

C t+1 = £ V (3.10) 

For the proof of Theorem 11.11 and 11.31 m tli e case of the PRG, we need to control the difference 
between C t and C_ t for fixed t. For this we will use the following proposition: 
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Proposition 3.2 There exist constants a 2 ,^2 > 0> such that for all < rj < a 2 and all t < t N = 
(l/2 + V )log,N, 

P (^2(Ck -£*)> iV 1/2 ~° 2 ^ < N-P 2 . (3.11) 

Proof. The proof is deferred to Section ID. 11 □ 

In order to prove Theorem 1 1 . 1 1 and Theorem 1 1.3 1 we will grow two NR- processes {Z\ % \ M^ }t>0j 
for i = 1,2. The root of {Zy , My }t>n starts from a uniformly chosen node or mark £ 
{1,2,..., N}. These two nodes are different whp, because 

V(A 1 = A 2 ) = ± 

By (13. 5f) the NR-process can be coupled to the neighborhood expansion shells {Af^} t >o and 
{J\fl }t>o- In the following lemma we compute the distribution of the number of edges between 
two shells with different subindeces, i.e., Af^ and Aft ■ 

Lemma 3.3 Fix integers k andt. Then conditionally onAfu and Aft and given that Mi C\Aft = 
the number of edges between the nodes in Af^ and 7V f is distributed as a Poisson random variable 
with mean 

k*ip±l. (3.12) 

In 

Proof. Conditioned on Af£\ TV/ 2 ' and Af^ ] n A/"/ 2> = 0, the number of edges between Aft and 
Aft is given by 

E E 4 P) > ( 3 - 13 ) 

i&dMf> jeSA/; (2) 

where E^p are independent Poisson random variables with mean \Xj/l N , see (ll.5p . Sums of 
independent Poisson random variables are again Poissonian, thus f)3. 13j) is a Poisson random variable 
with mean the expected value of (|3.13j) : 

I'll <o\ N N 

where we have used (|3.10p in the last step. □ 

The further proof of Theorems I1.HI1.3I crucially relies on the following technical claim: 

Proposition 3.4 There exist constants u 2 , v 2 , rj > such that for all t < t N = (1 + 2rj) logj, N, as 
N — > oo, 

fc=2 fc=2 

Proof. The proof is deferred to Section ID. 21 □ 

In the next section we will use this proposition in combination with Lemma [3.31 to replace sums 
over capacities, which do not depend on N, by sums over sizes of a BP, which no longer depend on 
N. 
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4 Proof of Theorem O and Q for the PRG 



In this section, we prove Theorem 11.11 and 11.31 for the PRG model. Using the coupling result in 
Proposition 12.31 we obtain Theorem 11.11 and 11.31 for all random graphs satisfying the assumptions 
Al and A2. As in the previous section, we denote by G N a PRG. 

We grow two NR-processes. Each NR- process starts from a uniformly chosen node A; € 
{1,2,..., N}, i = 1, 2, such that A 1 ^ A 2 , whp. 

Step 1: Expressing ¥(H N > t) in capacities. We have H N > 1 iff (if and only if) there are 
no edges between the nodes A\ and A 2 . Given the capacities C^~ ) and Cf\ the number of edges 
between the nodes A\ and A 2 has, according to Lemma 13.31 a Poisson distribution with mean 



cj l) cf > 



Therefore, 



(H N > 1) 



E 



cxp 



(4.1) 



(4.2) 



We next inspect the capacity of the first generation of Zj given by Cj . Given and Cj , 
that is the total capacity of the nodes in and the capacity of node A 2 , we again have a Poisson 
number of edges between node A 2 and the nodes in however, this time with parameter 



(4.3) 



In order to compute the survival probability F(H N > t) we need more notation. We write 
Qgi.*2) £ or conc litional probabilities given {Cl^lfcLi an( ^ {^l^lfcLi- We further write E^ 1 '* 2 ' for 
the expectation with respect to Q^, 1 '' 2 '. For t 2 = 0, we only condition on {Cj 1 /I 1 — Lemma | 
implies that 

( C^C^ 

Q ( c' t] (H N > k + t - 1\H N > k + t - 2) = exp 

Indeed, the event {H N > k + t - 2} implies that n A/" t (2) 
statement, 



From (|4.1|) and the above 



¥(H N > 2) 



^[Q^\H N > l)Q% 1} (H N > 2\H N > 1)] 
E[Q^\H N > [Q^\H N > 2\H N > 1)]] 

EfE^' 1 ' [Q^Offjv > 1) 
f C (1) C (2> 



E 



E 



exp 



exp 



f^\H N > 2\H N > I)]] 
C (1) C (2) 



• exp 



jy 



^(l) n C. 

Zuk=2 i^rfc/ai— l 



fc/2J 



By induction we obtain as in |13|. Lemma 4.1], 

F(H N >t) = E 



exp 



(4.4) 



Note that (|4.4p is an equality, while in |13j an error needed to be taken along. 

Step 2: Coupling with the delayed BP. In this step we replace C\ l) and Cf ) by iJ t (1) and Z t (2) . 
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For each event B, and any two nonnegative random variables V and W, 



|E[e 



E[e~ w ] | < \E[(e- v - e~ w )l B ] \ + P (B c ) 



Now take 



r 1 t+i i+i 
r — ) I "V 7 (2) - V r< (2) 

I fc=2 fc=2 



< N 



-u 2 



and the random variables V and as 

t+i 



, t+i 1 t+i 



^(2) 
■Lfc/2J • 



fc=2 



k=2 



Then for t < t N , the Proposition 13.41 implies that P(£> c ) = C(A r_?;2 ), whereas on the event B we 
have |V — W\ < N~ U2 . Hence, using that e~ v — e~ w = 0(y — w) when v,w are small, and that 
eT v < 1, v > 0, we obtain 



\E[e~ v ] - E[e~ w ] \ < 0(N- U2 )P(B) + P(B C ) = 0{N~ U2 ) + 0(N~ V2 ). 



(4.5) 



It is now clear from step 1, the above result and by replacing l N by Nfj,, using (|1.7|) . that for some 
P > and all t < i^, 



F(H N > t) = E 



cxp 



v^i+l 57(1) 57(2) 



+ 0(JV~0). 



(4.6) 



Step 3: Evaluation of the limit points. From this step on, the remainder of the proof of our 
main theorem is identically to the proof of Theorem 1.1 in |13j . To let the proof be self-contained, 
we finish the main line of the argument. Starting from (14, 6h we replace t by a N + t and assume 
that a N + t < t N , where, as before, a N = [logj, N\ , to obtain 



F(H N >a N + t)=E 



cxp 



^crjv+t+l 57(1) 57(2) 

2^k=2 ^Tfe/21 ^Lfe/2j 



+ 0(N-> 3 ). (4.7) 



We write N = u lo ^ N = u aN ~ aN , where we recall that a N = [log„ N\ - log u N. Then 

v^crjv+t+l 57(1) 57(2) ^a N +t+l 57(1) 57(2) 

l^k= 2 ^Tfe/21 ^Lfe/2J _ a N +t2^h 



fJ,N ^ fJ?V°N+t 

In the above expression, the factor v aN prevents proper convergence. 

Without the factor pv aN+t , we obtain from Appendix A4 of [13J that, with probability 1, 

V^CTjV+t+l -7(1) -7(2) VUWWP) 

N^oo ^V^N+t V — 1 

We now use the speed of convergence result of p], which was further developed in Section 2 of [13] 
and which reads that there exists a positive (3 such that: 

P(\W-W k \>(logN)- a ) = 0(N-f 3 ), k< L^log.iVj. (4.9) 
for each positive a. Combining (14. 7D and (14. 9p we obtain that for each a > and for t < 2i] logj, iV, 
P(i? JV > cr.v + t) =E[exp{-Ki/ a,v+ '>ViW2}] +0((logA^)- a ). (4.10) 

From (|4.1U|) one can finally derive as in [p3], that, asymptotically as N — > 00, the probability 
P(H N < 00) is equivalent to the probability q 2 = P(WiW2 > 0), where q is the survival probability 
of the branching process {Z t } t >o, so that (|4.10p induces for t < 2r]log u N, 

P(H N <a N + t\H N < 00) = E[l - exp{-Ki/ a ^ + *WiW 2 }|WiW 2 > 0] + o(l). (4.11) 
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A Appendix: Mixed Poisson distributions 

In this section, we study mixed Poisson distributions. 

Lemma A.l Consider a mixed Poisson distribution with mixing random variable Qf. Let, for 
n = 0,1,..., 

f n = F(Poi(O f ) = n), 
and consider the sequence {g n }n>o, where 

(n + l)/n+l 
9n E[0/] ' 

then {g n }n>o is a ^ so a mixed Poisson distribution with mixing random variable Q g , where 



E[6/] 



Proof. We will assume that (|A.1|) holds and show that P (Poi(Q g ) = n) = g n . Let Ff and F g 
be the distribution functions of 0/ and © 9 , respectively. Observe that 



dF g {x) x dFf(x) 
dx ~ E[6/] dx 



Therefore, 



poo r ii i poo r n+l 



n+1 f~ _ r x"+' (n + l)/„ +1 



which proves the claim. □ 

As an example, we consider the random variable 0y, that takes the value Aj, i = 1, 2, . . . , TV, 
with probability pi, then, using Lemma [A. 11 @g takes the value Aj, i = 1, 2, . . . , N, with probability 

P(0W = (A.2) 

We close this section with a lemma that relates condition (C3) to a condition on the capacities 
{A,}f =1 : 

Lemma A.2 (Moments of {<?n }n>0 an d {Aj}^) Xei g > 1, and assume that (CI) holds. 
Then, 

oo N 

limsup n q g < ^ ) < oo if and only if limsup — A? +1 < oo. (A. 3) 

Proof. Observe from (jl.8p . that 

n=0 i=l n=0 i=l 

Since q > 1, we have that, by the Holder inequality, 

E[Poi(Ai) 9 ] >E[Pm(Ai)] 9 = Af. (A.5) 
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As a result, we obtain that 

N oo 

^£ A ? +1 ^E™ 9 <T- (A.6) 

i=l n=0 

By (CI), lim sup^r.+oo /"jv = fJ> < oo, so that indeed limsup^v^oo jy- Z)i=i < 00 follows when 
limsupjv^^ X^^Lo n9 5 f " V) < 00 • This proves the first implication. 

For the second implication, we make use of the fact that there exists a constant c > such that, 
uniformly for all A > 0, E[Poi(X) q ] < c(X q + 1). Then, by (|X4]) . we have that 

00 JV 1 N 

E "W S ^ E MAf + 1) = c + ^ E A?"- (A.7) 

n=0 i=l j=l 

By (CI), liminfTv-^oo fJ-N = > 0, so that when limsupjy^^ j? Y2i=i < 00 > we obtain 

00 1 ^ 

limsup Vn 9 ^ < c + - limsup — V X Q+1 < 00. (A.8) 

This proves the second implication, and hence completes the proof. □ 



B Appendix: Deterministic capacities 

Let F be any distribution function, and write F(x) = 1 — F(x) for the survival function of F. 
Define the inverse -F _1 (tt), for u G (0, 1), by 



F-\u) =inf{s : F(s) < u}, 
where the definition is chosen such that 

F _1 (l -u) = F -1 (ii) = inf{x : F(x) > u}. 



(B.l) 



(B.2) 



We shall often make use of (|B.2p . in particular since it implies that F 1 (U) has distribution function 
F when U is uniform on (0,1). Throughout this section we use the abbreviations u N = \uN~\/N 
and U N = \UN]/N. 

In this section, we let the capacities {Aj}^ be given by 



\i = F- 1 (i/N), 



1,2, 



(B.3) 



where F: [0, 00) 1— > [0, 1] is a distribution function satisfying (|1.18|) . We shall verify that conditions 
(C1)-(C3) hold under appropriate conditions on F, the main condition being (|1.18p . but we shall 
need one more technical assumption. 

We let the random variable Q ? takes the value Aj, i = 1, 2, . . . , N, with probability Pi = jj, and 
note that the distribution of 0^ equals 0y = F _1 ({7 ]V ), where U has a uniform (0, 1) distribution. 
We start by identifying /, / (iV) , g and g (Ar) . We observe that, in this setting, 



C(JV) 



E[F(Poi(e { p 



We define 



/n=E 



n)] = E 



lfTT\n 



=1-1 



f -1 (i/) F (U) 



E[F(Poi(@ f ) = n)], 



(B.4) 
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where Of = F~ l {U). Observe that the used U is the same in the definition of Of, thus we 
implicitly introduced a coupling between the random variables Of and Of. Then, using that 
0/ > @f a.s., since u \—* F~ l (u) is non-increasing, 

1 oo 

5 E l/^ " /nl = d TV (P 0i (0^),Poi(O f )) < P(P O i(0f ) / P^(0/)) 



(Poi(0/ - 0$f°) > 1) < E[0/ - 0^] = E [f-^tO - F^fUn)} , (B.5) 



2 

71=1 



where the first inequality in the above chain is the coupling inequality, the second the Markov 
inequality. 

We first investigate the convergence of the integrals E[(0^ ) 1 ] to E[(0j)*], for i = 1,2, and 
then prove that this is sufficient for conditions (C1)-(C3). 

Lemma B.l (Convergence of moments of 0^) Let = F^^UjA, and assume that F 

satisfies (11.18p . Furthermore, assume that F is a continuous distribution function with density 
f: [0, oo) — ► [0, oo), and that there exist an a > such that, as y J, 0, 

where f{x) = info< J/ < x f(y). Then, there exists an a > such that 



C F- l (u){F-\u)-F~ l (u N )) du = 0{N' a ), (B.7) 
J o 



and, for i = 1,2, 

-l 



/ 

Jo 



F-^uY - F- 1 (u N ) i du = 0(N~ a ). (B.8) 

'o 

The proof of Lemma IB. II is followed by an example of cases where the conditions are satisfied. 
Proof of Lemma IB.lt We first claim that (11.18P implies that for y G (0, 1), 

F-\y)<c h y-\ (B.9) 

where b = l/(r — 1). We show the claim (TB.9P by contradiction. Suppose that there is y £ (0, 1), 
satisfying F~ 1 (y) > c b y~ b . Observe that if F~ 1 (y) = w, then F(x) > y for each x < y, by definition 
ABU)) . Then with x = c b y~ b < ^(y) = w, 

F(x) >y = cx^ T , (B.10) 

which contradicts (|1.18p . This proves (IB.9|h 

From the three statements in ()B.7p and ()B.8p . we only prove (jB.8[) for i = 2, the other two 
claims are similar, and in fact easier to prove. When i = 2, we can rewrite and bound 

/ F- l {uf - F~ l (u N f &u <2 I F-V^P-V) -P- 1 (u JV ))dn, (B.ll) 
J o ■/ o 

where we have used the special product x 2 — y 2 = (x — y){x + y) and that F~ l (u N ) < P _1 (u)> for 
all u € (0, 1). We split the integral into u € (0, N~ d ) and u G [-/V~ rf , 1), for an appropriate d > 0. 
For the former, we bound using (|B.9P 

r-N~ d r N~ d pN~ d 

/ F-^u^F^^-F-^u^duK F- 1 {u) 2 du<c 2b U - 2b du = ©(jV^ 1 - 26 )) , 
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where we used that b = l/(r — 1) < 1/2, since r > 3. For the integral over u € [N , 1), we note 
by Taylor's Theorem that 

F-\u) - F-\u N ) = -(u- Un )±F~\u* n ) = , (B.12) 

du f(F 1 (u%)) 

for some u* N E [u, n^]. Since u i— > F~ 1 (u) is non-increasing, we have that F~ 1 (ii^ r ) < F (u), so 
that 

Ujv — it 1 

" JV/(F-i(u))' ( ' 13) 

Thus, we arrive at 

T F-'in^F-^u) - F-Hu N )) dn < ^- f 1 du = O {N~ 1+ad ) , (B.14) 

Jiv-d N J N - d [[F l (u)) 

where, in the last inequality, we made use of (|B.6j) . We conclude that 

f F~\u) 2 - F-\u N ) 2 du < 0{N- d ^- 2b ^) + 0{N ad - 1 ) = 0(N- a ), (B.15) 
Jo 

where a = min{d(l — 26), 1 — ad} > 0, provided we pick d > so small that 1 — ad > 0. □ 

We now present the promised example: Consider the class of those continuous distribution 
functions F satisfying (jl,18p for which f(x) = F'(x) is positive and decreasing on [0, oo). We shall 
prove that such F indeed satisfy (|B.6|) . We will use (|B.9|) to prove (|B.6P when f(x) = F'(x) is 
positive and decreasing on [0, oo). Then, for such /, f(x) = f(x). In this case, we note that 

_P-*(u) d - x 2 

2 f{F-\u)) = -Tu F (R16) 

so that, by (lET9|) with b = l/(r - 1), 

2 f y J^T^ ^ = F-\V? < c 2b y- 2b = 0(y- 2b ) , (B.17) 

so that (|B.6[) is satisfied. In particular, for 

F A (x) = x~ T+1 , x>l, (B.18) 

this proves the claim. 

We will now show that the conditions (C1)-(C3) do hold for the weights {Xi}fL l = F^ii/N). 
We start with validating the conditions (CI) and (C3), since for condition (C2) more effort is 
needed. 

Condition (CI): Combining (fL6j) with (fB~3j) we rewrite 

i N r 1 

VN = TT Y / F~ 1 (i/N)= F-'Mdu, (B.19) 
^ i=i Jo 

where as before u N = . Similarly, one can show that 

1 f 1 - 

v N = — / F- 1 (u N ) 2 du. (B.20) 
Jo 
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Therefore, we choose 



u = I F (u)du 
'o 



and 



fi Jo 



F^iufdu. 



(B.21) 



Effectively, we have replaced u N in (|B. 19[) and (|B.20p by u. Observe that the following also hold 
u = E[Q f ], n N =M[e ( p), v = E[& 2 f ]/E[@ f ] and v N = E[(0W] /E[9' w) ] . (B.22 



Condition (CI) is satisfied, since Lemma IB. H where we take i = 1, implies the existence of a 
constant a.% > such that 



|/^-/x|< / (tt) — F" 1 (u N ) | du < N~ ai . 

Jo 



(B.23) 



Using (1ET2"0|) and (IB~2T]) . we obtain that 



Wn - v\ 



— f 1 F- l {u N fdu-- f 1 F-\u) 2 du 
Jo H Jo 

1 



< 



1 

JO 



F- 1 (u N ) 2 - F- 1 (u, 



1 1 



du + | — - - ) / F- 1 {uf du. 



We bound the above display, using Lemma IB. II and ()B.23|) . by 



\Vn — V < 



1 



1 



1 



F -1 (u) 2 du = 0(N- ai ] 



since 



F" 1 {u) 2 du = E F" 1 (?7) 2 = E[0j] < oo, 
where ©/ = F -1 (U) is a random variable with distribution F satisfying (|1.18p . 
Condition (C3): Observe that, again using that u i— > -F~ 1 (u) is non-increasing, 

N 1 W 

E\r-l-e _ J_ 
i N 



(B.24) 



i=i t=i 



< 



E[F _1 (i7) 



l/Tn-r-l-el 



and F 1 (C7) is a random variable with distribution F which satisfies ()1.18p . compare with (|B.24|) , 
Therefore E[F _1 (C/) T_1 ~ e ] < oo, which in turn implies that 



1 N 

lim sup — >^ XJ 1 6 < oo. 

i=i 



Since, for fixed N, we have that F 1 (i/N) is non- increasing in i, we obtain from (|B.9|) . 



AN) 



max F-t-ti/N) = F _1 (l/iV) = C(7V 1/(1 - t) ) < iV 7 . 

Ki<N V y 



Condition (C2): We bound (IB.5D by Lemma lB.lt there exists a constant Q2 > such that 



d TV (Foz(0j Ar) ),Foi(O / )) <E F~ L (U) - F~ L [U h 



0(N- 
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Similarly as (|B.5|) . we obtain 

1 oo 



n=0 



<E[|G S -G^|]=E 

We claim that for two distribution functions G: R 
fl-(O) = 0, we have 



e 



(B.25) 



[0,1] and F: R i-> [0,1] with G(0) 



/•l /*oo 

/ |G~V) dw= / |G(x) dx. 



(B.26) 



To see this, note that when G(x) > H(x) for every x € R, so that G(x) — H(x) > 0, G _1 (u) — 
H~ l (u) > 0, then both sides of (|B.26|) are equal to ELY] — E[Y], where X and 1" have distribution 
functions G and if respectively. This proves the claim for G and H for which G(x) > H(x) for 
every x € R. For other G and -ff, we write, for x € [0, oo), 



so that 



(G V #)(x) = max{G(x), F(x)}, (G A #)(x) = min{G(x), H(x)}, 



G(x) -H(x)\dx = (G V H) (x) — (G AH) (x) dx. 



(B.27) 



(B.28) 



Then, both G A H and G V H are survival functions for which (G V H)(x) > (G A H){x) for every 
x 6 R. Thus, by the above proof, (1B.26|1 holds for them. The claim is then completed by noting 
that 

(GV^W =inf{s : max{G(s), if (s)} < u} = max{G _1 (n), (B.29) 



and 
so that 



(GAH)~ L (u) =inf{s : min{G(s), <u} = min{G _1 (u),fr _1 (u)} (B.30) 



(G V H)~ L (u) - (G A H)~ (u) = \G- L (u) - H- L (u)\. 



(B.31) 



This completes the proof of (jB.26j) . 

Using (|B.26p . we can rewrite display (|B.25P as 



d TV (P0i(0^),P0i(0g)) < 



du 



Fe 3 (x)-F (N) (x) dx. (B.32) 



Recall that \x = E[0y] and \i N = E 0^ , then, using Lemma lA. 11 the integrand can be rewritten 



as 



F e 3 (x) - F Q m (x 



E 



e, ef 

— 1 {G / >x} " " ^ 



< E 



F-\U) 



F-Mu 



1 {F~ 1 {U)>x} 



F-\U. 



fJ-N 



" 1 {F- 1 (t/jV>a!} 



/i ± {^- 1 H>4 ^ ■ L {F-i(n JV )>4 



du. 



(B.33) 
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Using (|B.32|) . (|B.33|) . and the triangle inequality twice, we obtain 



d TV (Poi(e ( g N) ),Poi(e g )) < 



1 


1 


poo p 




/% 


Jo Jo 



i*O0 pi 

+ — I / F-\u) 

fJ-N Jo JO 



1 



1 



1 {F- 1 (u)>x} ~ 1 {F~ 1 {u N )>x} 



du dx 



-{F-i(u N )>x} dudx - ( B - 34 ) 
Next, we will bound each double integral in display (|B.34p . The first double integral is finite, 



+ — / / \F- L (u) - F- L (u,r)\li 
Mjv Jo Jo 



since 



/"OO ^1 fl 

J F-\u)l {p _ 1{u)>x} dudx = F~ 1 (u) 2 du = E[Q}} <oo, (B.35) 

where we used in the last step that the distribution F satisfies (|1.18p . Observe that l|^-i( u ) >x | 
1 {F- 1 («iv)>a;} if and only if ^~ 1 (. u n) < x < F^ 1 ( u )? thus 



oo pi 



F-Hu) 



1 {F-^{u)>x} 1 {F~ 1 (u N )>x} 



dudx= / F- 1 (u)(F- 1 (u)- F- 1 ^)) du. 

Ju=0 1 Jo 

(B.36) 

Using (|B.7p it follows that the right hand side of (|F3.36|) is C(A r_a ). Finally, using again (|B.7|) . 



roo rl rl 

J J {F- 1 (u)-F- 1 (u N ))l {p „ 1{uN)>x} dudx = J {F-H^-F-'iu^F-h 



u N ) du 



< F-\u) [F-\u) -F-\u N )) du = 0(N- a ). (B.37) 
Jo 

Combining (|B735|1 - (|B737|) with ([B34]) . we obtain that 

d TV (Poi(e { g N) ),Poi(e g )) = o(- -—) + —o(N- a ) + — 0[N~ a ) = o(N- a *), 

fl fl N u N fl N 

where we used that \fi — u N \ = 0(N~" ai ") and take 03 = min{a,ai}. 

C Appendix: i.i.d. capacities 

In this section, we prove Theorem 11.41 by showing that the results of the Poissonian random graph 
with deterministic capacities also hold when the capacities are replaced by i.i.d. capacities satisfying 
(fTTHjh 

More precisely, we associate a sequence {Ai}^L 1 of positive i.i.d. random variables to the nodes, 
where the random capacities {Aj}^ =1 have distribution function F\(x) = P (A < x), and we set the 
capacity of node i equal to Aj, for 1 < i < N. 

Define 

J=J(a)=J 1 (a)Dj 2 (a)nJ 3 , (C.l) 

where 



Jl(a) = {\fi N - Ml < N~ a } n{\u N -u\< N- a } , 
J 2 (a) = {d TV (f N \f) < N~ a } n {d TV (9 <N \g) < N- a ] , 

and, for some B > sufficiently large, 

[n=0 J 



(C.2) 
(C.3) 



(C.4) 
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and where the exponent 7 is equal to l/(r — 1) +e (recall (|1.12p ). 
In Sections IC.lHC.3l below, we will show that there exist constants 



such that the events Ji(at\), ^2(02), J3 occur with probability exceeding 1 — 0( y N~ IBl ) , 1 — 0[N~^ 2 ) 
and 1 — C(A r_ ^ 3 ), respectively. Then, by taking a = min{ai,a2} and f3 = min{/3i, /32, /%} the 
event J {a) occurs with probability exceeding 1 — 0{N~^~). On the event Jiot) the conditions 
(C1)-(C3) do hold, and this then proves Theorem 11.41 

For the sake of completeness, we give the counterparts of (I1.6P and (11. 8p using the random 
capacities. Display (jl.6p becomes 



^X> and ^ = _i_^A|, (C.5) 



i=l 



Using the law of large number the values fi N and v N converge a.s. to \i and v in (|1.17p . respectively. 
The offspring distribution of the first and second generation is given by (compare (|1.8|) ). 

which, by the strong law of large numbers, converge a.s. to / and g in (|1.17p . respectively. 
C.l Convergence of means (CI): 

In this section we will show that condition (CI) holds whp, i.e, that there exist constants a±, (3\ > 
such that the event 

Jliai) = {\ns-iA <N- ai }n{\u N -u\ <N- ai } 

occurs with probability exceeding 1 — ©(iV"^ 1 ), where /i and v are defined in (|1.17p . 
The next lemma is the crucial estimate of the proof: 

Lemma C.l Let q 6 (0, r — 1) and define 

1 N 

S N , q = - ^ A? and S Ntq (u) = {\S N , q - E[A*]| < N~ u } , 
i=i 

then there exist constants u, v > such that 

F(S N>q (u)) = l-0(N~ v ). (C.7) 

Proof. Apply the Marcienkiewicz-Zygmund inequality [12], with = A^ — E[A|] and r £ (1,2), 
such that qr < r — 1. Then, 

P(s„,,(«)«) = P||^y>,| >iv-"| =r| |iVx,f >at 



(lif>|>"-")=^|> 



since E[|Xi| r ] < E[|Ai| ?r ] < 00. Now recall that r > 1 and fix u > so small that v = r(l — u) — 1 > 
0. □ 
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Using Lemma lC.lt we pick the constants u, v > such that 

w(s N)1 {u)ns N , 2 {u)) = i-o(N- v ). 

Observe that (jl n = S N] % and u N = S N) 2/S N> i. On the event S N) i(u) PI S N) 2(u), we have that 

= \S N , 1 -E[A)\ = 0(N- u ), 



and 



Wn - v\ 



S N , 2 E[A 2 



S N ,i E[A] 



< 5 



iv,2 



1 1 



^,1 E[A] 



E[A] 



E[A] 5 



iV,l 



(e [a 2 ] +o(at- m )) o(iv- M ) 



E[A] (E[A] - 0(N- U )) 
Therefore, we can pick a\ = u and (3\ = v. 



+ 0(N~ U ) = 0(N- U ). 



(Ci 



□ 



C.2 Convergence of branching processes (C2): 

In this section we will show that there exist constants c*2 , /?2 > such that 

P(d TV ( 9 (Ar) , 5 ) > AT-" 2 ) = 0(N~fc). 



(C.J 



The proof of the existence of constants a 2 ,/?2 > such that P(d TV (/ (Ar) , /) > N~ a2 ) = 0(N~ 132 ) 
is less difficult and therefore omitted. 

In this section, we add a subscript N to P and E, to make clear that we condition on the i.i.d. 
capacities {Aj}^ 1 . Thus, 



r(-)=F(-\A 1 ,...,A N ) and E w [-] = E[- |Ai, . . . , Ajv] . 



(CIO) 



Using Lemma lA.ll with {/n } n >o and {f n }n>o, it follows that = Pv(-Po2(©e/ V) ) = n) 



where, for i = 1, 2, . . . , N, 



P w (0W=Ai) 



A, 



and g n = P (Poi(@ g ) = n), where @ g is given by 



Fe g (x) = -E[A1 {A < X} ] 
I- 1 

(N) 



(C.ll) 



Let U uniformly distributed over (0, 1), then we couple and Poi{Q g ) by letting Q g = F Q \U) 
and Q g N) = F~^ N) (U). Arguing as in (|B.5p . we find that, 

d TV (g {N \g) = d TV (Poi(ew) ,Poi(e fl )) < E N [\e<p - e g \] . 

Then, using the coupling and (|B.26j) . we can write 



d TV (g {N \g) <E N [\QW-Q g \] =E f 



/ 

Jo 



where i*e„ is given by (lCTTT|) and F w by 



du- 



N 



F e (m(x) ~ Fe g (x) dx, (C.12 



F^(N) (x 



2 = 1 



(C.13) 
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Before we prove (|C.9|) . we state two claims. The proofs of these claims are deferred to the end 
of this section. The first claim is that 



d TV (g {N) ,g)< 



1 



VN + 



E N 
A* 



where 



E N = —)^Ajl{ Az>x} -E[Alj A>a .) 
J ° iV i=l 



dx. 



Secondly, we claim that there exist constants a, ft > such that 

¥(E N > N~ a ) = 0(N~ 13 ). 



(C.14) 



(C.15) 



(C.16) 



We finish the proof of (|C.9|) . given the claims (|C.14|) and (|C.16p . Indeed, from Section fC.il we 
know that there exist constants ai,j3\ > such that P (<Ji(oti)) > 1 — C(A r ~^ 1 ). 
Combining ([Cl4"l) and ([ClB]) . we can bound the total variation by 



(d TV (g m ,g) > N- a *) < N a2 E 



U N + E N / [l J l{B N <JV-«}nJi(ai) 



+ P(E N > N~ a ) +0(N- Pl ) = (iv- min W-«2,«-«2,/3,/3i}). ( C _ 17 ) 

Pick a2 £ (0, min{ai, a}), then the right-hand side in the above display tends to zero as N tends 
to infinity, and (ET9l) follows, when (ICTTIt) and (ICTTBl) hold. □ 

Finally, we finish the proof by showing, in turn, the claims (|C.14|) and (|C.16|) . 

Proof of ([CH4|h Using (jOTTI) . (^IT2l) and §gJM we bound d TV {g {N \g) by 



roc 1 1 

d TV (g <N \g) = / jj— ^ Ail {Ai>x} - -E[Al {A>x] 

f°° 1 A 
+ - J Q nY, A ^>-} -E[A1 { a>,}] 



d.r 



1 

where in the last step we used that 
This proves (fCTTII) . 

Proof of (|C.16jl : For a > 1, we define the density f a by 

f / \ _ a ~ 1 
MXJ " (x + l)«" 



(C.18) 



□ 



(C.19) 
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Then, for any 5, a > 1, by subsequently using the Markov inequality and then Jensen's inequality, 



{E N > N- 



E S N > N- aS 



< N aS (a - l)- d E 



:i + x) a 



i N 

— ^2Ail {Ai>x} - E[Al {A>x} ] 



1=1 



(C.20) 

\ S~ 



fa(x) dx 



< JV E 



N 



aS 



l (l + x) a ^ -J>l {Ai>:c} -E[Al {A>:c} ] 

/•oo 

/ (l + x) a(5 - 1} E 

J 



1 " 

-^A,l {Ai>ffl} -E[M {A>a;} ] 



dx. 



In order to bound (]C.20|> . we use the Marcinkiewicz-Zygmund inequality [12, Chapter 3; Corollary 
8.2] on the variables Xi(x) = Ail^ A . >x y, 1 < i < N. Then we can bound (|C.20|) from above by 



E M > N~ 



/•oo 

*) < 0(N aS ) / (1 + x) a( ^ 1) E 
Jo 

roc r 

<0{N a5+1 - & ) J (I + x)< s -^¥]\X\ 



1 N 

-Y^X t (x)-E[X(x)} 



dx 



x)-E[X(x)}\ 



dx. 



Since for any x > 0, 



E 



X(x)-E[X(x)]\ s <E X(x) s = / y s dF A (y), 



(C.21) 



(C.22) 



we obtain 



(l + xf^E -E[X(x)}\ 5 dx < / (l + x)^ 1 ) / y s dF A (y)dx 



x=0 
oo 



y=x 



poo 

< / y 5 (l + y)^-^ 1 dF A (y) < oo, (C.23) 
Jo 



when we first pick a > 1 and next 5 > 1 so small that 5 + a(5 — 1) < r — 2. 
Combining (|C.2ip and (|C.23p . we obtain 

F(E N > N- a ) = 0(N aS+1 - s ) = 0(N~ ( - 5( - 1 - a ^), 



(C.24) 



The right hand side of (|C.24p . tends to zero if 5(1 — a) — 1 > 0, which is the case if we choose a 
small and 5 slightly bigger than 1. □ 

C.3 Moment and maximal bound on capacities (C3): 

In this subsection we show that P (J 3 ) = 1 - O(N-fo) . Observe, that for every £ > 0, using Boole's 
inequality, 

N 

i=i 

If £ > l/(r — 1) then the right hand side in the above display tends to zero. Remember that 
7 = 1/ (r — 1) + e, therefore we have 

P(A#° > A r7 ) = 0(N- iT -^ £ ). 
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This proves the bound on the maximal capacity in (C3). 

Apply Lemma IC .11 with q = r — 1 — e, which states that there exist constants u, v > such that 



l-P(5 w , T _i_ e («)) 



/ 1 N 



> at-« = 0(AT-«). 



Furthermore, (jA.7|) in the proof in Lemma I A. 21 shows that there exist constants C±,C2 < oo such 
that, uniformly in {Aj}^, 



E^flg < + C 2 1^ AT 1 -. (C.25) 

n=0 i=l 

Therefore, for A r sufficiently large and B > Ci + C 2 E[A r_1_£ ] , 

1 - P ( Jz) < P (Ag° > iV 7 ) + P ( { f; n r - 2 - £ 5 W > B I n 5 W)T _!_ 6 («) ] + 0(A^) 



\ kn=0 J / 

< ©(AT^" 1 ) 6 ) + + 0(Ar") = o^-mMO"-!)^}^ (C.26) 
Thus, we may take f3% = min{(r — l)e,u}. 

D Appendix: Coupling of shortest path graphs to branching pro- 
cesses 

In this appendix, we prove Proposition 13.21 in Section ID. II and Proposition 13.41 in Section ID. 21 
D.l Proof of Proposition 

In this part of the appendix we give the main result on the coupling between the capacities of the 
NR-process and those of the NR-process. For convenience we restate Proposition 13.21 as: 

Proposition D.l There exist constants r],ui,Vi > 0, such that for all t < (1/2 + rj) log„N, 

P (^p(C k -C^) > N^ 2 - Ul ^j < N- Vl . (D.l) 

We start with an outline of the proof Proposition ID. II 

The difference between C k and stems from individuals, whose mark has appeared previously 
and which is consequently discarded together with all its descendants. Call such an individual a 
duplicate. To show the claim (jD.ip we need some more details of the thinning procedure of the 
NR-process {Z t , M t }t>o. We relabel the marks of the NR-process 

given in Section 13.11 as 

M , M 1 ,M 2 , • • • , A% x , A% 1+1 , . . . (D.2) 

By definition, Mq is a random variable uniformly chosen from {1, 2, . . . , N} and the marks M v , for 
v > 0, are independent copies of the random variable M given by (|3.2p . Introduce the random 
variable Y v , such that Y v = 1 if M v is a duplicate and Y v = otherwise, so that 

Yv = 1 u^- 1 {M„=M w }' ( D - 3 ) 
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and denote by s(v), v > 0, the generation of individual v in the NR-process. Let Dup 4 be the set 
of all the duplicates in the first t generations of the NR-process, so that 



Dup t = {v > 1 : Y v = 1 and s(v) <t}, t>0. 



(D.4) 



Consider the subtree with root the duplicate d, d E Dup t . The NR-process is a marked BP, 
therefore the subtree with root d is also a marked BP and we denote this subtree by \Z\ , M} }t>o- 
Observe that the mark of a duplicate, which we denote by the random variable M^Y^ = 1, depends 
on the marks seen previously; the distribution of the mark of a duplicate is biased, because of the 
duplication. As a consequence also the offspring distribution of the first generation of {Z^} t >o 
differs and we will denote this distribution by 

/« = P (Poi(\(M d )) = n\Y d = 1) , n > 0, 



(D.5) 



where A(o) = A a for any a. In second and further generations the marks are independent copies 
of the mark M with distribution given by (|3.2|) and the offspring are independent with common 
distribution {gn} n >o, defined in (|3.7p . Therefore, denoting by 



(M#,M$,...,M<%), k>0, 



the vector of marks of the individuals in generation k of the subtree with root d, we have for k = 0, 



that M 0jl has distribution M^|Yd = 1, whereas for k > 1, all the marks M fe • are independent with 
common distribution given by the right side of (|3.2I) , 
By construction, 



fc=0 

and, similarly, 



t-s(d) t-s(d) 

< E E ^' = id^i+ E E 

dSDup t k=0 <ieDup t k=l 



(D.6) 



E(^-^)< E + E E E A «U 

fc=0 deDup t dGDupi k=l m=l 



(D.7) 



Suppose that we can find an event T>, with P(2? c ) = 0(A~''/ 2 ), such that the event T> ensures 
that there are only a few mismatches between the NR-process and NR -process. Then the Markov 
inequality, yields: 



\k=\ 



E(^ 



CM 



V 



k=l 



+ 0{N~ 11 ' 2 ). 



(Di 



According to (TD.8P it is hence sufficient to show that 



E 



.k=l 



0(N 1/2 ~ Ul ~ Vl ). 



(D.9) 



The capacity of each duplicate is bounded from above by A 7 , as a consequence of (C3). Fur- 
thermore, as the marks in the second and further generations of a duplicate are independent copies 
of the random variable M, and E[Aw] = v N , the expected value of (|D.9P becomes 



E 



E(^ 



c k )i 



p 



k=l 



< A 7 E[|Dup t |lo] + j/jvE 



t-s(d) 

E E^ 

d£Dup t k=l 



(D.10) 
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Using auxiliary lemmas we will show that 

E[|Dup t |l c ] = 0{N 1 ' 2 - p -^) and E 



t-s(d) 

E E^ 

efeDup t k=l 



0(N x l 2 -P), 



(D.ll) 



for some p > 0. Combining (|D.10p and (|D.ll j) yields (|D.9p . We end the outline with a list of all 
statements that we intend to prove and which together imply the statement in Proposition ID.1L 
The list consists of the following steps: 

1. Define V and show that ¥ (V c ) < N^l 2 , 

2. Show the two statements in (ID. 1 1 h . 

To control the size of the set Dup t we will use two lemmas. 
Lemma D. 2 For rj,S £ (—1/2, 1/2) and all t < (1/2 + 77) log,, N, 

/ * \ 



Y^Z k > iV 1/2+5 = 0((log v N)N-( s -ri). 



\k=0 



Consequently, if {Mt}t>o 

are the reachable sets of an uniformly chosen node A in the graph Gn, 

then 

Vt| > N^ 2+5 ^ = 0{Qog u N)N-V- ri >). 



Proof. From (CI), and for k < c\og u N, 



0(N C ), 



(D.12) 



Taking c = \ + 77, and using ([TT]) and (|D.12jt . we find for i < (1/2 + 77) log,, iV, 
\k=0 / k=0 k=0 

t 

= N -i/2-s^2 0(N l / 2+r >) = 0((log u N)N^- s ). 

k=0 

Using the coupling (|3.5|) . 

p (\Mt\ > iV 1/2+<5 ) = p [E^ > Nl/2+S ) ^ p ( E^ fc > ^ 1/2+<5 ) = C((log^iV)iV^' 5 ) 



(D.13) 



\k=0 



\k=0 



□ 



Lemma D.3 For each 5 € (-1/2, 1/2), u > and v < N l / 2+& , 

p ( E Yw ^ ^ ) = o(^ (u_2,5) ) 



(D.14) 



V«i)=l 



Proof. From the definition of Y v , see (|D.3|) . and Boole's inequality. 

v-l 

P(Y v = l\M v = m) = ¥ (u v -Jo {M w = m}) < ^ P (M w = 



tu=0 



^ 1 f 
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Therefore, using v < N l l 2+& , we have that 

t N , N 

P (F„ = 1 



^ N \ N A 2 i 

T7 2^7- + (^-l)2^7^ = ^ + 



iV ^ l N 

m=l 



A 2 n _l , U — 1 H N ^ 1 fi 



N i/. 



<± + E» N 8-l/2 = (N S - 1 / 2 ). 

N v ' 



m=l N 

From the Markov inequality, and since v < N l l 2+& , we find 



^ F tt > AT« J < AT-" ^2P(Y W = 1) = {vN & - l / 2 -' a ) = O (N 2S ~ U ) . 



\w=l 



w=l 



□ 



Proof of Proposition ID. 11 Define the event V = T>\ n T>2 D £>3, where 

Pi = {Dup L{1/ 2_ 2r))logy7 vj = 0} , (D.15) 

P 2 = {|Dup L(1/2+r?)log!/iV j| < iV 5 "} , (D.16) 

V 3 = P| {zf<iV e }, (D.17) 

and where = 677 + 7. 

It remains to prove that P(£> c ) = 0(iV _?? / 2 ) (Step 1), with T> as defined above, and the two 
statements in (jD.lip (Step 2). 

We start with the first statement in (jD.lip , We have, 



E[|Dup t |lx>] < E |Dup 



>L(l/2+ n ) log.TVJ 



< N 5V < ATV2-P-7 



(D.18) 



by choosing r/ and p small and the fact that 7 £ (l/(r — 1), 1/2), c.f. (I1.12|) . 

The second statement in (jD.lip proceeds in the following way. On the event T>\ n T>2 duplicates 
do not appear in the first [(1/2 — 2r\) log„ iVj generations, implying that on T>\ n T>2, 



max (t - s(d)) < t- [(1/2 - 2r))log u N\ < 3rjlog„N, 

d€Dup t 



since t < (1/2 + rj) log,, N. Thus 
E 



t-s(d) 

E E4^ 

ci€Dup t fc=l 



< E 



L3"log,iVJ 

E E ^ 

(iGDupj fc=l 



(D.19) 



(D.20) 



The right side is the expected size of the progeny of the duplicates and their offspring. The 
expectation of the total number of children of all the duplicates on P3 is bounded from above by 
the expectation of the total number of duplicates times N 8 . Since on T>2 the total expected number 
of duplicates is bounded from above by N 5l] , the total number of children of all the duplicates is 
bounded from above by N 5v+e on T>2 C\T>s. Since, the mark of a child of a duplicate is by definition 
an independent copy of the random variable M, the offspring distribution of each child of a duplicate 
is an independent copy of {Z/};> , where {Zi}i> is a BP with Zq = 1 and where each individual 
has offspring distribution {g n }n>0- Therefore, the right side of (jD.20l) can be bounded by 



E 



t-s{d) 

E E 

deDup t k=l 



i37?log„JVJ 

E ^ 

fe=i 



Using dEUD, 



E 



E 

k=l 



L3n(log„ TV) J L3^(l°g„ N)\ 



E 

k=l 



0(N 3ri ) = 0((log u N)N 3l i) = 0(N 471 ). 



k=l 



32 



Thus, we can bound (|D.20|) by 



E 



t-s(d) 

E E^ d)1 - 

dGDup t k=l 



< 



which is sufficient for the second statement of (|D,llj) . since 7 < 1/2 and we can pick r\ arbitrarily 
small. This completes the proof of Step 2. 

We continue with the proof of Step 1. For this, we bound 



\v c ) < p(i>f) + p(z>§) + p(©3 n v 2 ). 



(D.21) 



We now bound each of these terms one by one. For E(T>\), we use Lemma ID . 2 1 and Lemma |P.3l to 
obtain 

L(l/2-2f,) log, JVJ \ \ 

i v\ ) < ? 1 ^ i 1 < E Zfc < Nl/2 ~ v \ J + (( lo s- ^0^") 



< 



'LATl/2-»7j 

y w > 1 + o(n-^ 2 ) = o{n^' 2 ), 



(D.22) 



UI = 1 



and, similarly, 



P(X»§) < P £>£n < 



L(l/2+J?) log, JVJ 

]T Z fe < JVVa+*i }> I + o((log u N)N- r >) 

k=0 



< 



'LATl/2+27)j 

Y Y w > N 5r > I + ©(iV^/ 2 ) = C(jV-" -l 



(D.23) 



u>=l 



Using the Markov inequality and Boole's inequality we have that 



E ^ 



p 2 



cJeDup fc * 



(D.24) 



where, for convenience, we set k* = [(1/2 + rj) log u N\ . If we condition on the sequence {M v } v >q 
then Z[ d \ for d £ Dup fc *, is an independent Poisson random variable with mean \(M<i). Thus, 
conditioned on the sequence {M v } v >q, the sum X^eDup fc , ^ s distributed as a Poisson random 
variable with mean ^deDup,., ^d- I n turn this sum can be stochastically bounded from above by 
a Poisson random variable with mean |Dup fc , |iV 7 , because A(M^) < < jV 7 for all d G Dup fc , 
by (C3). Since T>2 and Dup fc » are measurable with respect to {M v } v >q, the above yields 



E 



E z ^ 

dgDup fc * 



E 



E[Poi(Z[ d) )\{M v } v > ]l V2 

d£Dup fe * 



E 



E 



{M v } v 



>o 



LP 2 



/••• I 2 

K d£Dup k , 

< E[E[Poi(|Dup fc «|jV 7 )| {M v } v > ] 1 V2 ] 

< E[E[Poi(iV 5,?+r )| {M v } v >q\] = E[Poi(N 5ri+ ~f)] = jV 5j ?+ 7 . 



(D.25) 
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Thus, we bound ([TIM by 



P (2?| n V 2 ) < n- 6 e 



E 

dgDup fe * 



(D.26) 



Hence, by substituting (|ET22]) . (|DT23l> and (1TX261) in (fTX2i~j) we obtain, 

P(P C ) = C(iV^/ 2 ), 
which proves Step 1 and therefore the proposition. 



□ 



D.2 Proof of Proposition 13.41 

In this section we prove Proposition 13.41 which we restate here for convenience as Proposition ID.4[ 

Proposition D.4 There exist constants 03,(33,7] > such that for all t < (1 + 2r])log v N, as 
N -> 00, 



t+i 



t+i 



k=2 k=2 

We start with an outline of the proof. Define 

1 t+i 1 t+l 

1 ~ ]v Lfe/2j i^rfe/ail ana -t2 - ^ 2^— rfe/21 l^/2j ~^ 



(D.27) 



iV 



(2) 

lfe/2j | 



(D.28) 



fc=2 k=2 

To show (|D~27j) . it suffices to prove that for an appropriate chosen event 7i, 



T i l H >\N~ a A =0{N-&), i = 1,2, 



(D.29) 



and that P (H c ) = 0{N for some £ > 0. We choose the event TL such that on this event, for 
each k < t < (1 + 2rj) log^ N, 



E[Z$ /2i l H ]=0(N^), 



E[\Z^-C!^\i n }= (N^) : 



E[C$ m l H ] = O(N^) and E[|2» aJ - Cg m | 1„] = ©(iV 1 /^), 
for some u,a3,r/ > such that 

«3 < u — 2ry. 



(D.30) 
(D.31) 



(D.32) 



The claims (jTX30jh (jD~3T|) and (TfX32|) imply (TTX29]) . This is straightforward from the Markov 
inequality, the independence of between Z (2) and the associated capacities and the inequality 
t < (1 + 277) log„ iV. We leave it to the reader to verify this. 

Instead of showing (|D.30|) and (|D.31|) . it is sufficient to show that there exist constants as,r],u > 
satisfying (|D.32j) and such that for each k < t < (1/2 + if) \og v N, 



M z k ~ G*\ lw] = 0(N 1 / 2 - u ), E[Z k l n ] = 0{N 1 l 2+2 ^) and E\C*1 H ] = 0(N 1 / 2+2 ^), 



(D.33) 



which simplifies the notation considerably. 
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We choose H = Hi PI Hi n H3, where 

(D.34) 

H 2 = ft =1 H 2 ,k, (D.35) 
H 3 = r\l =1 n 3>k . (D.36) 

Hence, for the proof of Proposition ID. 41 it suffices to show the three claims in (|D.33p together with 
u — 2n > 0, and that for some £ > we have that P (H c ) = G(N~^ . For the latter statement, we 
use the following lemmas. In the first lemma we denote by d N a random variable with distribution 
{gn^n^o given by (pTT|) . 

Lemma D.5 (Bound the variance of the marked capacity) If the conditions (ClJ-(C3j do 
hold then 

Var(\ M ) = O(N^) and Var(d N ) = 0(N" f ), (D.37) 

where 7 was introduced in (|1.12p . 

Lemma D.6 (Strong convergence of {g^ln^o to {g n }n>o) Assume that conditions given by 
^ClJ-^C3j do hold. Then, there exists an a > such that 

00 

J2n\9 ( n } -9n\ = 0(N- a ). (D.38) 

n=0 

The proofs of the above lemmas are deferred to the end of this section. We start with the proof of 
Proposition ID.4} given the Lemmas ID.5HD.6l 

Proof of Proposition ID. 41 We have to prove that the three claims of (|D.33|) hold, with u—2rj > 0. 
Consider the first claim given by (ID. 331) . Using the triangle inequality we arrive at 

E[\C^-Z k \l H ] <E[|C,-C fe |l w ] +E[\C k -v N Z k _ 1 \l H ) 

+ E[\Z k _ 1 (v N -v)\l H ] +vE[\Z k _ 1 -Z k _ 1 \ ln ] + E[\uZ k _ 1 -Z k \l H ]. (D.39) 

The first, second and the last term on the right side of (|D.39|) . we bound by tv 1 / 2-111111 ^'" 1 ^, using 
the events Hi, H2 and H3, respectively. We bound the third term of (|D.39jl using (|D.12j) . which 
gives 

EQ^K -"){]< N-^ElZ^} = N-^v^ 1 = 0{N l l 2+ ^- 5 ). (D.40) 

Finally, we need to bound the fourth term of (|D.39p . The following inequality holds for each k < t 
(compare with [13 (A. 1.4) and (A.1.15)]), 

t 

E[\Z k -Z k \) < maxjt/ - a N ,v N - a N }^E[Z t ] (maxj^^})''' , (D-41) 

1=1 

where 

00 

a N = ^nmin{5 n ,t/<f }. 

n=0 



H 2 , k = {\C k -u N Z k _ 1 \ <AT 1 / 2 - 5 }, 
Hz ]k = {\Z k -vZ k _ 1 \ <N l l 2 - & Y 
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We bound the sum in ([KIT]) , using (JTZD and (|D~T2j) : 



y^E[Zj] (max {u, v N }) 1 1 = yi/J-N^N 1 (max-jV, t^y})* ' < imax {//, /x^} (max {z/, ^iv})' 1 



/=i 



i=i 



< t (l + 0{N-^)) t O(N 1 ' 2 ^) < N 1 / 2 ^, 



for N sufficiently large. Furthermore from Lemma ID. 61 

oo 

max {v - a N , v N - a N } < ^ n \g n - \ < N~ 



(D.42) 



(D.43) 



n=l 



(D.44) 



for some a\ > 0. Combining (|D.41j) . (|D.42j) and (|D.43p . we obtain that, 

E[\Z k - Z k \] = o{N l > 2+2r i- ai ). 
All together, the left side of (|D.39|) satisfies 

- Z k\ 1«] = 0(Ar 1 /2-min{5,« 1 ,5-^,a 1 -2^}^ 

This yields the first claim in (|D.33|) with 

u = min {5, u% , S — rj, a± — 2rj}. 
The second claim of ()D.33|) is evident, because k < t < (1/2 + rj) log u N and therefore 

E[Z k l n ] < E[Z k ] = ^ < ^v 1 - 1 = 0{N 1 / 2+ ^) . (D.45) 
Finally, the third claim of (|D.33p follows from using that < C h , 



E[C fc l H ] < E[CJ < E[C k ] = E 



Y KM*-*,*) 



v=l 



Now by taking conditional expectation with respect to Z k _ x and the capacities, we obtain 



E 



v=l 



Z k -i 



Z k -i 



E[A(M fc _ 1 ,„)| Z k _,] = Z^EIXm] = v N Z k _ u 



v=l 



so that (|D.12|) implies that 
E[C^1 H ] < E 



E 



%k-l 

^(Mk-i, v ) 



v=l 



Z k ., 



E[u N Z k _ 1 ]=0[N^). 



Thus, we have shown the claims given by ()D.33|) if we restrict rj to < rj < min{5, ai/2} due to 
(|D.44|) . To satisfy condition (|D,32p . we restrict rj to 



< rj < min{ai/4, ui/2,5/2}, 
and pick = (u — 2rj)/2. Then > and condition (|D,32p is satisfied, because 

u — rj < 03 = (u — 2rj)/2 = min{ai — Arj, u% — 2r/, 5 — 2r)}/2, 
for each < 5 < 1/2, and for each r\ given by (|D,46p . 



(D.46) 
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We finish the proof by showing that P (W c ) = 0(N for some £ > 0. A simple bound is 



'(H c )<^P( W f). 



(D.47) 



i=l 



We bound P (Til) using Proposition ID, 11 which states that there exist constants u%, v\ such that 

F(Hi) = 0(N~ V1 ). (D.48) 

Next, we will bound P (H^) an d P CW§) from above. Using Chebychev's inequality, we will show 
that 



p = 0(i V - 1 /2+2«5+7+2^ ; fo r A; < t < (1/2 + 7?) log,, N. 

Then by Boole's inequality we have, 



(D.49) 



P(^) < E P (^2,fc) = C(tiV- 1 /2+25+7+2^ = 0((l ogi/ iV)Ar-(l/2-25-7-2r ; )^ ^ ^ 

fc=0 

Fix k, 1 < k < t < (1/2 + ??) log^ iV, and denote by V v = \(M k _ liV ) - u N for v = 1, 2, . . . , Z fc _ x . 

We have that Efl4] = 0, and that, conditionally on Z k _ 1 , the sequence {V v } v =i is an i.i.d. sequence. 
Hence, using Lemma lD.51 



E 



2> 

v=l 



Zk-i 



Zk-l Zk-l Zk-i Zk-1 Zk-i 

E = E E E ^ E ^ + E E t^ 2 ] 

11=1 w = l,W^V 



v=l w=l 
Zk-1 



v=l 



= E [K 2 ] = -Zfc-iVar (A(M)) = 0(iV 7 Z fc _^ 
«=l 

Therefore, and since by ([DT2]) . E[Z fc _J = = 0(iV 1/2+ ' 7 ), 



(D.51) 



Var K 

y u=i 

Thus, by the Chebyshev inequality, 













E 


E 



















> jV 1 ^ < jv 2a_1 Vax E M = 



1/2+25+7+^ 



Similarly, we can show that 



(Hl k )=0{(log u N)N 



v=l 



-(1/2-25-7- 



when we replace Z k ^ 1 by Z k _ 1 , set U n = ATfc_! n — E[djv], where Xfc-i,n is an independent copy of 
d N . Using Lemma TP. 21 and (jD.45|) . this yields 



















E^ 




E 


[(H 








u=i 













iV^E^Var (d w )] = iV 2<5 - 1 Var(d JV )E[^_ 1 ] = 0(^25-1/2+^+7). 
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The above yields, compare with (|D.50|) . 

F(H$) =o(N- {1 / 2 - 2S -~<- 2r iy). (D.52) 
Combining (jH38l> . (|D30|) . ([1X521) with (ITI471) gives 

p(H c ) = 0(i\r*), 

where £ = min{(l/2 — 25 — 7 — 2r/),ui}. Remember that 1/2 — 7 > 0, that 5 is restricted to 
< 5 < 1/2 and rj is restricted by (|D.46|) . So, pick rj, 5 > such that 25 + rj < 1/2 — 7, because we 
can pick 5 > and 77 > arbitrary small, then £ > 0. □ 

Proof of Lemma ID.51 Consider the first claim of (jD.37j) . The variance of a random variable is 
bounded from above by its second moment. Therefore, using (|3.2p . (jl,7p and (jl.lip . 

Var (Am) < E[A M ] = - £ A*> < f - £ X 2 m < N^u N = („ + O(iV^)) = O(iV^) . 

m=l m=l 

We turn to the second claim of (|D.37p . Using ()A.7[) . where we take q = 2, and (C3) we bound 

00 1 N ,(jv) -. N 

Var (^) < £ n 2 <T = c + -f-I £ A? < c + £ A? = c + erf = O(iV^) . 

n=l ^ iV i=l iV . =l 

□ 

Proof of Lemma ID. 6t By Fatou's Lemma, as well as the convergence of hm^v^oo gn = g n which 
follows from (C2), we have 

00 00 00 

y^n q g n = Vn^liminf^ < liminf V n q g^° < 00, (D.53) 

n=0 n=0 n=0 

for g < r — 2 by Lemma I A. 21 combined with (C3). 
Then, we split, for ( > 0, 

00 L^ C J 00 

X>|<?r ~9n\ < £ H^f 5 -5n| + £ n| 5 r -ffnl- (D.54) 

n=0 n=0 n=|ATfJ+l 

Using (C2), we bound 

LATCj x 

£ n|flW " 9n\ < [N<\ £ - g n \ < N^. (D.55) 

n=0 n=0 

Using (C3) as well as (|D.54p . we further obtain for e > 0, 

00 00 

£ n \9 ( n N) ~ 9n\ < iV-?(-3- £ ) J- n T - 2 - £ (g^+g n ) = 0(N-^~ 3 ^). (D.56) 
n=[N<] n=[N(\ 

Fix £ < /? and e € (0,r — 3), which is possible since r > 3, then the claim follows with a = 
min{/3-C,C(^-3-e)}. □ 
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